RAW SEQUENCE LISTING 



The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: TJS/10/031, 496D 



DATE: 12/02/2004 
TIME: 14:41:47 



Input Set : A:\seq 4 99-45.ST25.txt 
Output Set: N:\CRF4\12022004\J031496D.raw 

3 <110> APPLICANT: National Renewable Energy Laboratory 

5 <120> TITLE OF INVENTION: Cellobiohydrolase I Gene and Improved Variants 
7 <13 0> FILE REFERENCE: NREL 99-45 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/10/03 1 , 496D 



c- - > y 


<141> 


CURRENT FILING DATE: 2 002-01- 


14 


q 


<160> 


NUMBER OFSEQ ID NOS : 96 




i i 


<170> 


SOFTWARE: Patentln version 3. 


3 


1 O 


<210> 


SEQ ID NO: 1 




14 


<211> 


LENGTH: 2 8 




lb 


<212> 


TYPE: DNA 




lb 


<213> 


ORGANISM: Artificial 




1 ft 


<220> 


FEATURE : 




1 Q 

i y 


<223> 


OTHER INFORMATION: Primer for 


PCR 


Z 1 


<400> 


SEQUENCE: 1 




99 


agagagtcta gacacggagc ttacaggc 




9 R 

Z* Zj 


<210> 


SEQ ID NO: 2 




9 

Z. D 


<211> 


LENGTH: 3 5 




27 


<212> 


TYPE: DNA 




28 


<213> 


ORGANISM: Artificial 




30 


<220> 


FEATURE : 




31 


<223> 


OTHER INFORMATION: Primer for 


PCR 


33 


<400> 


SEQUENCE: 2 




34 


aaagaagcgc ggccgcgcct gcactctcca atcgg 


37 


<210> 


SEQ ID NO: 3 




38 


<211> 


LENGTH: 24 




39 


<212> 


TYPE: DNA 




40 


<213> 


ORGANISM: Artificial 




42 


<220> 


FEATURE : 




43 


<223> 


OTHER INFORMATION: Primer for 


PCR 


45 


<400> 


SEQUENCE : 3 




46 


ggcggaaacc cgcctggcac cacc 




49 


<210> 


SEQ ID NO: 4 




50 


<211> 


LENGTH: 1551 




51 


<212> 


TYPE: DNA 




52 


<213> 


ORGANISM: Trichoderma reesei 




55 


<220> 


FEATURE : 




56 


<221> 


NAME /KEY : CDS 




57 


<222> 


LOCATION: (1) . . (1545) 




59 


<220> 


FEATURE : 




60 


<221> 


NAME/KEY : misc_f eature 




61 


<222> 


LOCATION: (1) . . (51) 




63 


<220> 


FEATURE : 




64 


<221> 


NAME/KEY : misc_f eature 
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65 <222> LOCATION: (52) . . (1344) 

67 <220> FEATURE: 

68 <221> NAME /KEY : miscjbinding 

69 <222> LOCATION: (1436) .. (1550) 

71 <400> SEQUENCE: 4 

72 atg tat egg aag ttg gec gtc ate teg gee ttc ttg gec aca get cgt 

73 Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 

74 1 5 10 15 

76 get cag teg gec tgc act etc caa teg gag act cac ccg cct ctg aca 

77 Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 

78 20 25 30 

80 tgg cag aaa tgc teg tct ggt ggc acg tgc act caa cag aca ggc tec 

81 Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

82 35 40 45 

84 gtg gtc ate gac gec aac tgg cgc tgg act cac get acg aac age age 

85 Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

86 50 55 60 

88 acg aac tgc tac gat ggc aac act tgg age teg acc eta tgt cct gac 

89 Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 

90 65 70 75 80 

92 aac gag acc tgc gcg aag aac tgc tgt ctg gac ggt gec gec tac gcg 

93 Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 

94 85 90 95 

96 tec acg tac gga gtt acc acg age ggt aac age etc tec att ggc ttt 

97 Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

98 100 105 110 

100 gtc acc cag tct gcg cag aag aac gtt ggc get cgc ctt tac ctt atg 

101 Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

102 115 120 125 

104 gcg age gac acg acc tac cag gaa ttc acc ctg ctt ggc aac gag ttc 

105 Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

106 130 135 140 

108 tct ttc gat gtt gat gtt teg cag ctg ccg tgc ggc ttg aac gga get 

109 Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 

110 145 150 155 160 

112 etc tac ttc gtg tec atg gac gcg gat ggt ggc gtg age aag tat ccc 

113 Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 

114 165 170 175 

116 acc aac acc get ggc gec aag tac ggc acg ggg tac tgt gac age cag 

117 Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

118 180 185 190 

12 0 tgt ccc cgc gat ctg aag ttc ate aat ggc cag gec aac gtt gag ggc 

121 Cys Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

122 195 200 205 

124 tgg gag ccg tea tec aac aac gcg aac acg ggc att gga gga cac gga 

125 Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He Gly Gly His Gly 

126 210 215 220 

12 8 age tgc tgc tct gag atg gat ate tgg gag gec aac tec ate tec gag 
129 Ser Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser He Ser Glu 



48 
96 

144 

192 

240 

288 

336 
384 
432 
480 
528 
576 
624 
672 
720 
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130 


225 








230 






235 










240 




132 


get 


ctt 


ace ccc 


cac 


cct 


tgc 


acg act gtc 


ggc 


cag 


gag 


ate 


tgc 


gag 


768 


133 


Ala 


Leu 


Thr Pro 


His 


Pro 


Cys 


Thr Thr Val 


Gly 


Gin 


Glu 


He 


Cys 


Glu 




134 








245 






250 










255 






136 


ggt 


gat 


ggg tgc 


ggc 


gg a 


act 


tac tec gat 


aac 


aga 


tat 


ggc 


ggc 


act 


816 


137 


Gly Asp 


Gly Cys 


Gly 


Gly 


Thr 


Tyr Ser Asp 


Asn 


Arg 


Tyr 


Gly 


Gly Thr 




138 






260 








265 








270 








140 


tgc 


gat 


ccc gat 


ggc 


tgc 


gac 


tgg aac cca 


tac 


cgc 


ctg 


ggc 


aac 


acc 


864 


141 


Cys Asp 


Pro Asp 


Gly 


Cys 


Asp 


Trp Asn Pro 


Tyr 


Arg 


Leu 


Gly 


Asn 


Thr 




142 






275 








280 






285 










144 


age 


ttc 


tac ggc 


cct 


ggc 


tea 


age ttt acc 


etc 


gat 


acc 


acc 


aag 


aaa 


912 


145 


Ser 


Phe 


Tyr Gly 


Pro 


Gly 


Ser 


Ser Phe Thr 


Leu 


Asp 


Thr 


Thr 


Lys 


Lys 

2 




146 




290 








295 






300 












148 


ttg 


ace 


gtt gtc 


ace 


cag 


ttc 


gag acg teg 


ggt 


gee 


ate 


aac 


cga 


tac 


960 


149 


Leu 


Thr 


Val Val 


Thr 


Gin 


Phe 


Glu Thr Ser 


Gly 


Ala 


He 


Asn 


Ara 


Tvr 




150 


305 








310 






315 










320 




152 


tat 


gtc 


cag aat 


ggc 


gtc 


act 


ttc cag cag 


ccc 


aac 


gee 


gag 


ctt 


qqt 


1008 


153 


Tyr Val 


Gin Asn Gly 


Val 


Thr 


Phe Gin Gin 


Pro 


Asn 


Ala 


Glu 


Leu 


Glv 




154 








325 






330 










335 






156 


agt 


tac 


tct ggc 


aac 


qaq 

ZD Z) 


etc 


aac gat gat 


tac 


tgc 


aca 


get 


qaq 

ZJ^ZD 


qaq 

ZD ZJ 


1056 


157 


Ser 


Tyr 


Ser Gly Asn 


Glu 


Leu 


Asn Asp Asp 


Tyr 


Cys 


Thr 


Ala 


Glu 


Glu 




158 






340 








345 








350 








160 


gca 


gaa 


ttc ggc 


gga 


tec 


tct 


ttc tea gae 


aag 


ggc 


ggc 


ctg 


act 


cag 


1104 


161 


Ala 


Glu 


Phe Gly Gly 


Ser 


Ser 


Phe Ser Asp 


Lys 


Gly 


Gly 


Leu 


Thr 


Gin 




162 






355 








360 






365 










164 


ttc 


aag 


aag get 


ace 


tct 


ggc 


qqc atq qtt 

ZJ ZJ Z) ZJ 


ctg 


gtc 


atg 


agt 


ctg 


tgg 


1152 


165 


Phe 


Lys 


Lys Ala Thr 


Ser 


Gly 


Gly Met Val 


Leu 


Val 


Met 


Ser 


Leu 


Trp 




166 




370 








375 






380 












168 


gat 


gat 


tac tac 


gec 


aac 


atg 


ctg tgg ctg 


gac 


tec 


acc 


tac 


ccg 


aca 


1200 


169 


Asp Asp 


Tyr Tyr Ala 


Asn 


Met 


Leu Trp Leu 


Asp 


Ser 


Thr 


Tyr 


Pro 


Thr 




170 


385 








390 






395 










400 




172 


aac 


gag 


ace tec 


tec 


aca 


ccc 


ggt gee gtg 


cgc 


gga 


age 


tgc 


tec 


acc 


1248 


173 


Asn 


Glu 


Thr Ser 


Ser 


Thr 


Pro 


Gly Ala Val 


Arg 


Gly 


Ser 


Cys 


Ser 


Thr 




174 








405 






410 










415 






176 


age 


tec 


ggt gtc 


cct 


get 


cag 


gtc gaa tct 


cag 


tct 


ccc 


aac 


gee 


aag 


1296 


177 


Ser 


Ser 


Gly Val 


Pro 


Ala 


Gin 


Val Glu Ser 


Gin 


Ser 


Pro 


Asn 


Ala 


Lys 




178 






420 








425 








430 








180 


gtc 


ace 


ttc tec 


aac 


ate 


aag 


ttc gga ccc 


att 


ggc 


age 


acc 


ggc 


aac 


1344 


181 


Val 


Thr 


Phe Ser Asn 


He 


Lys 


Phe Gly Pro 


He 


Gly 


Ser 


Thr 


Gly Asn 




182 






435 








440 






445 










184 


cct 


age 


ggc ggc 


aac 


cct 


ccc 


gg c gga aac 


ccg 


cct 


ggc 


acc 


acc 


acc 


1392 


185 


Pro 


Ser 


Gly Gly Asn 


Pro 


Pro 


Gly Gly Asn 


Pro 


Pro 


Gly 


Thr 


Thr 


Thr 




186 




450 








455 






460 












188 


ace 


cgc 


cgc cca 


gee 


act 


ace 


act gga age 


tct 


ccc 


gga 


cct 


acc 


cag 


1440 


189 


Thr Arg 


Arg Pro 


Ala 


Thr 


Thr 


Thr Gly Ser 


Ser 


Pro 


Gly 


Pro 


Thr 


Gin 




190 


465 








470 






475 










480 




192 


tct 


cac 


tac ggc 


cag 


tgc 


ggc 


ggt att ggc 


tac 


age 


ggc 


ccc 


acg 


gtc 


1488 


193 


Ser 


His 


Tyr Gly Gin 


Cys 


Gly 


Gly He Gly 


Tyr 


Ser 


Gly 


Pro 


Thr 


Val 




194 








485 






490 










495 
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196 


tgc 


gcc 


age 


ggc 


aca 


act 


tgc 


cag 


gtc 


ctg 


aac 


cct 


tac 


tac 


tct 


cag 


197 


Cys Ala Ser Gly Thr Thr Cys 


Gin 


Val 


Leu 


Asn 


Pro 


Tyr 


Tyr 


Ser 


Gin 


198 








500 










505 










510 






200 


tgc 


ctg 


taa 


agctcc 
























201 


Cys 


Leu 






























205 


<210> SEQ ID NO 


: 5 
























206 


<211> LENGTH: 514 
























207 


<212> TYPE: 


PRT 


























208 


<213> ORGANISM: 


Trichoderma 


reesei 
















210 


<400> SEQUENCE: 


5 
























212 


Met 


Tyr 


Arg 


Lys 


Leu 


Ala 


Val 


He 


Ser 


Ala 


Phe 


Leu 


Ala 


Thr 


Ala 


Arg 


213 


1 








5 










10 










15 




216 


Ala 


Gin 


Ser 


Ala 


Cys 


Thr 


Leu 


Gin 


Ser 


Glu 


Thr 


His 


Pro 


Pro 


Leu 


Thr 


217 








20 










25 










30 






220 


Trp 

XT 


Gin 


Lys 


Cys 


Ser 


Ser 


Gly 


Gly 


Thr 


Cys 


Thr 


Gin 


Gin 


Thr 


Gly 


Ser 


221 






35 










40 










45 








224 


Val 


Val 


He 


Asp 


Ala 


Asn 


Trp 


Arg 


Trp 


Thr 


His 


Ala 


Thr 


Asn 


Ser 


Ser 


225 




50 










55 










60 










228 


Thr 


Asn 


Cys 


Tyr 


Asp 


Gly 


Asn 


Thr 


Trp 


Ser 


Ser 


Thr 


Leu 


Cys 


Pro 


Asp 


229 


65 










70 










75 










80 


232 


Asn 


Glu 


Thr 


Cys 


Ala 


Lys 


Asn 


Cys 


Cys 


Leu 


Asp 


Gly 


Ala 


Ala 


Tyr 


Ala 


233 










85 










90 










95 




236 


Ser 


Thr 


Tyr 


Gly 


Val 


Thr 


Thr 


Ser 


Gly 


Asn 


Ser 


Leu 


Ser 


He 


Gly 


Phe 


237 








100 










105 










110 






240 


Val 


Thr 


Gin 


Ser 


Ala 


Gin 


Lys 


Asn 


Val 


Gly 


Ala 


Arq 


Leu 


Tyr 


Leu 


Met 


241 






115 










120 










125 








244 


Ala 


Ser 


Asp 


Thr 


Thr 


Tyr 


Gin 


Glu 


Phe 


Thr 


Leu 


Leu 


Gly 


Asn 


Glu 


Phe 


245 




130 










135 










140 










248 


Ser 


Phe 


Asp 


Val 


Asp 


Val 


Ser 


Gin 


Leu 


Pro 


Cys 


Gly 


Leu 


Asn 


Gly 


Ala 


249 


145 










150 










155 










160 


252 


Leu 


Tyr 


Phe 


Val 


Ser 


Met 


Asp 


Ala 


Asp 


Gly 


Gly 


Val 


Ser 


Lys 


Tyr 


Pro 


253 










165 










170 










175 




256 


Thr 


Asn 


Thr 


Ala 


Gly 


Ala 


Lys 


Tyr 


Gly 


Thr 


Gly 


Tyr 


Cys 


Asp 


Ser 


Gin 


257 








180 










185 










190 






260 


Cys 


Pro 


Arg 


Asp 


Leu 


Lys 


Phe 


He 


Asn 


Gly 


Gin 


Ala 


Asn 


Val 


Glu 


Gly 


261 






195 










200 










205 








264 


Trp 


Glu 


Pro 


Ser 


Ser 


Asn 


Asn 


Ala 


Asn 


Thr 


Gly 


He 


Gly 


Gly 


His 


Gly 


265 




210 










215 










220 










268 


Ser 


Cys 


Cys 


Ser 


Glu 


Met 


Asp 


He 


Trp 


Glu 


Ala 


Asn 


Ser 


He 


Ser 


Glu 


269 


225 










230 










235 










240 


272 


Ala 


Leu 


Thr 


Pro 


His 


Pro 


Cys 


Thr 


Thr 


Val 


Gly 


Gin 


Glu 


He 


Cys 


Glu 


273 










245 










250 










255 




276 


Gly 


Asp 


Gly 


Cys 


Gly 


Gly 


Thr 


Tyr 


Ser 


Asp 


Asn 


Arg 


Tyr 


Gly 


Gly 


Thr 


277 








260 










265 










270 






280 


Cys 


Asp 


Pro 


Asp 


Gly 


Cys 


Asp 


Trp 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gly 


Asn 


Thr 


281 






275 










280 










285 








284 


Ser 


Phe 


Tyr 


Gly 


Pro 


Gly 


Ser 


Ser 


Phe 


Thr 


Leu 


Asp 


Thr 


Thr 


Lys 


Lys 


285 




290 










295 










300 










288 


Leu 


Thr 


Val 


Val 


Thr 


Gin 


Phe 


Glu 


Thr 


Ser 


Gly 


Ala 


He 


Asn 


Arg 


Tyr 
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289 305 310 315 320 

2 92 Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly. 
293 325 330 335 

2 96 Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 
297 340 345 350 

3 00 Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 
301 355 360 365 

3 04 Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
305 370 375 380 

3 08 Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
309 385 390 395 400 

312 Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 

313 405 410 415 

316 Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

317 420 425 430 

320 Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 

321 435 440 445 

324 Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 

325 450 455 460 

328 Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 

329 465 470 475 480 

332 Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 

333 485 490 495 

336 Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 

337 500 505 510 
340 Cys Leu 

344 <210> SEQ ID NO: 6 

345 <211> LENGTH: 45 

346 <212> TYPE: DNA 

347 <213> ORGANISM: Artificial 

349 <220> FEATURE: 

350 <223> OTHER INFORMATION: Primer for PCR 

352 <400> SEQUENCE: 6 

353 cctcccggcg gaaacccgcc tggcaccacc accacccgcc gccca 45 

356 <210> SEQ ID NO: 7 

357 <211> LENGTH: 32 

358 <212> TYPE: DNA 

359 <213> ORGANISM: Artificial 

361 <220> FEATURE: 

362 <223> OTHER INFORMATION: Primer for PCR 

364 <400> SEQUENCE: 7 

365 ggactcacgc tacggccagc agcacgaact gc 32 

368 <210> SEQ ID NO: 8 

369 <211> LENGTH: 36 

370 <212> TYPE: DNA 

371 <213> ORGANISM: Artificial 

373 <22 0> FEATURE: 

374 <223> OTHER INFORMATION: Primer for PCR 
376 <400> SEQUENCE: 8 
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Invalid <213> Response; 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:l # 2,3,6, 7, 8, 9, 10, 11 ,12, 13, 14 ,15, 16, 17,18, 19, 20, 21 ,22,23,24,25,26,27,28,29 
Seq#: 3 0,31, 32, 33, 34, 35, 36, 3 7, 38, 39, 40, 41 ,42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53 
Seq#: 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64 ,65, 66,67,6 8,6 9,70,71,72,73,74,75,76,77 
Seq#: 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96 
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